Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 795
Filtrar
1.
Psychon Bull Rev ; 2024 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-38565841

RESUMEN

We review the evidence for the conceptual association between arithmetic and space and quantify the effect size in meta-analyses. We focus on three effects: (a) the operational momentum effect (OME), which has been defined as participants' tendency to overestimate results of addition problems and underestimate results of subtraction problems; (b) the arithmetic cueing effect, in which arithmetic problems serve as spatial cues in target detection or temporal order judgment tasks; and (c) the associations between arithmetic and space observed with eye- and hand-tracking studies. The OME was consistently found in paradigms that provided the participants with numerical response alternatives. The OME shows a large effect size, driven by an underestimation during subtraction while addition was unbiased. In contrast, paradigms in which participants indicated their estimate by transcoding their final estimate to a spatial reference frame revealed no consistent OME. Arithmetic cueing studies show a reliable small to medium effect size, driven by a rightward bias for addition. Finally, eye- and hand-tracking studies point to replicable associations between arithmetic and eye or hand movements. To account for the complexity of the observed pattern, we introduce the Adaptive Pathways in Mental Arithmetic (APiMA) framework. The model accommodates central notions of numerical and arithmetic processing and helps identifying which pathway a given paradigm operates on. It proposes that the divergence between OME and arithmetic cueing studies comes from the predominant use of non-symbolic versus symbolic stimuli, respectively. Overall, our review and findings clearly support an association between arithmetic and spatial processing.

2.
J Learn Disabil ; : 222194241241040, 2024 Apr 09.
Artículo en Inglés | MEDLINE | ID: mdl-38591175

RESUMEN

A growing body of evidence suggests that children with dyslexia in alphabetic languages exhibit visual-spatial attention deficits that can obstruct reading acquisition by impairing their phonological decoding skills. However, it remains an open question whether these visual-spatial attention deficits are present in children with dyslexia in non-alphabetic languages. Chinese, with its logographic writing system, offers a unique opportunity to explore this question. The presence of visual-spatial attention deficits in Chinese children with dyslexia remains insufficiently investigated. Therefore, this study aimed to explore whether such deficits exist, employing a visual search paradigm. Three visual search tasks were conducted, encompassing two singleton feature search tasks and a serial conjunction search task. The results indicated that Chinese children with dyslexia performed as well as chronological age-matched control children in color search tasks but less effectively in orientation search, suggesting a difficulty in the rapid visual processing of orientation: a deficit potentially specific to Chinese dyslexia. Crucially, Chinese children with dyslexia also exhibited lower accuracy, longer reaction times, and steeper slopes in the reaction times by set size function in the conjunction search task compared to control children, which is indicative of a visual-spatial attention deficit.

4.
Exp Brain Res ; 2024 Apr 23.
Artículo en Inglés | MEDLINE | ID: mdl-38652274

RESUMEN

The ability to adapt to the environment is linked to the possibility of inhibiting inappropriate behaviours, and this ability can be enhanced by attention. Despite this premise, the scientific literature that assesses how attention can influence inhibition is still limited. This study contributes to this topic by evaluating whether spatial and moving attentional cueing can influence inhibitory control. We employed a task in which subjects viewed a vertical bar on the screen that, from a central position, moved either left or right where two circles were positioned. Subjects were asked to respond by pressing a key when the motion of the bar was interrupted close to the circle (go signal). In about 40% of the trials, following the go signal and after a variable delay, a visual target appeared in either one of the circles, requiring response inhibition (stop signal). In most of the trials the stop signal appeared on the same side as the go signal (valid condition), while in the others, it appeared on the opposite side (invalid condition). We found that spatial and moving cueing facilitates inhibitory control in the valid condition. This facilitation was observed especially for stop signals that appeared within 250ms of the presentation of the go signal, thus suggesting an involvement of exogenous attentional orienting. This work demonstrates that spatial and moving cueing can influence inhibitory control, providing a contribution to the investigation of the relationship between spatial attention and inhibitory control.

5.
Foods ; 13(7)2024 Mar 29.
Artículo en Inglés | MEDLINE | ID: mdl-38611366

RESUMEN

Green fruit detection is of great significance for estimating orchard yield and the allocation of water and fertilizer. However, due to the similar colors of green fruit and the background of images, the complexity of backgrounds and the difficulty in collecting green fruit datasets, there is currently no accurate and convenient green fruit detection method available for small datasets. The YOLO object detection model, a representative of the single-stage detection framework, has the advantages of a flexible structure, fast inference speed and excellent versatility. In this study, we proposed a model based on the improved YOLOv5 model that combined data augmentation methods to detect green fruit in a small dataset with a background of similar color. In the improved YOLOv5 model (YOLOv5-AT), a Conv-AT block and SA and CA blocks were designed to construct feature information from different perspectives and improve the accuracy by conveying local key information to the deeper layer. The proposed method was applied to green oranges, green tomatoes and green persimmons, and the mAPs were higher than those of other YOLO object detection models, reaching 84.6%, 98.0% and 85.1%, respectively. Furthermore, taking green oranges as an example, a mAP of 82.2% was obtained on the basis of retaining 50% of the original dataset (163 images), which was only 2.4% lower than that obtained when using 100% of the dataset (326 images) for training. Thus, the YOLOv5-AT model combined with data augmentation methods can effectively achieve accurate detection in small green fruit datasets under a similar color background. These research results could provide supportive data for improving the efficiency of agricultural production.

6.
Sci Rep ; 14(1): 8886, 2024 Apr 17.
Artículo en Inglés | MEDLINE | ID: mdl-38632476

RESUMEN

As one of the three major outdoor components of the railroad signal system, the track circuit plays an important role in ensuring the safety and efficiency of train operation. Therefore, when a fault occurs, the cause of the fault needs to be found quickly and accurately and dealt with in a timely manner to avoid affecting the efficiency of train operation and the occurrence of safety accidents. This article proposes a fault diagnosis method based on multi-scale attention network, which uses Gramian Angular Field (GAF) to transform one-dimensional time series into two-dimensional images, making full use of the advantages of convolutional networks in processing image data. A new feature fusion training structure is designed to effectively train the model, fully extract features at different scales, and fusing spatial feature information through spatial attention mechanisms. Finally, experiments are conducted using real track circuit fault datasets, and the accuracy of fault diagnosis reaches 99.36%, and our model demonstrates better performance compared to classical and state-of-the-art models. And the ablation experiments verified that each module in the designed model plays a key role.

7.
J Xray Sci Technol ; 2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38552134

RESUMEN

Highlights: • Introduce a data augmentation strategy to expand the required different morphological data during the training and learning phase, and improve the algorithm's feature learning ability for complex and diverse tumor morphology CT images.• Design attention mechanisms for encoding and decoding paths to extract fine pixel level features, improve feature extraction capabilities, and achieve efficient spatial channel feature fusion.• The deep supervision layer is used to correct and decode the final image data to provide high accuracy of results.• The effectiveness of this method has been affirmed through validation on the LITS, 3DIRCADb, and SLIVER datasets. BACKGROUND: Accurately extracting liver and liver tumors from medical images is an important step in lesion localization and diagnosis, surgical planning, and postoperative monitoring. However, the limited number of radiation therapists and a great number of images make this work time-consuming. OBJECTIVE: This study designs a spatial attention deep supervised network (SADSNet) for simultaneous automatic segmentation of liver and tumors. METHOD: Firstly, self-designed spatial attention modules are introduced at each layer of the encoder and decoder to extract image features at different scales and resolutions, helping the model better capture liver tumors and fine structures. The designed spatial attention module is implemented through two gate signals related to liver and tumors, as well as changing the size of convolutional kernels; Secondly, deep supervision is added behind the three layers of the decoder to assist the backbone network in feature learning and improve gradient propagation, enhancing robustness. RESULTS: The method was testing on LITS, 3DIRCADb, and SLIVER datasets. For the liver, it obtained dice similarity coefficients of 97.03%, 96.11%, and 97.40%, surface dice of 81.98%, 82.53%, and 86.29%, 95% hausdorff distances of 8.96 mm, 8.26 mm, and 3.79 mm, and average surface distances of 1.54 mm, 1.19 mm, and 0.81 mm. Additionally, it also achieved precise tumor segmentation, which with dice scores of 87.81% and 87.50%, surface dice of 89.63% and 84.26%, 95% hausdorff distance of 12.96 mm and 16.55 mm, and average surface distances of 1.11 mm and 3.04 mm on LITS and 3DIRCADb, respectively. CONCLUSION: The experimental results show that the proposed method is effective and superior to some other methods. Therefore, this method can provide technical support for liver and liver tumor segmentation in clinical practice.

8.
Exp Brain Res ; 2024 Mar 31.
Artículo en Inglés | MEDLINE | ID: mdl-38555556

RESUMEN

Healthy individuals typically show more attention to the left than to the right (known as pseudoneglect), and to the upper than to the lower visual field (known as altitudinal pseudoneglect). These biases are thought to reflect asymmetries in neural processes. Attention biases have been used to investigate how these neural asymmetries change with age. However, inconsistent results have been reported regarding the presence and direction of age-related effects on horizontal and vertical attention biases. The observed inconsistencies may be due to insensitive measures and small sample sizes, that usually only feature extreme age groups. We investigated whether spatial attention biases, as indexed by gaze position during free viewing of a single image, are influenced by age. We analysed free-viewing data from 4,243 participants aged 5-65 years and found that attention biases shifted to the right and superior directions with increasing age. These findings are consistent with the idea of developing cerebral asymmetries with age and support the hypothesis of the origin of the leftward bias. Age modulations were found only for the first seven fixations, corresponding to the time window in which an absolute leftward bias in free viewing was previously observed. We interpret this as evidence that the horizontal and vertical attention biases are primarily present when orienting attention to a novel stimulus - and that age modulations of attention orienting are not global modulations of spatial attention. Taken together, our results suggest that attention orienting may be modulated by age and that cortical asymmetries may change with age.

9.
Spectrochim Acta A Mol Biomol Spectrosc ; 313: 124166, 2024 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-38493512

RESUMEN

Rapid, effective and non-destructive detection of the defective maize kernels is crucial for their high-quality storage in granary. Hyperspectral imaging (HSI) coupled with convolutional neural network (CNN) based on spectral and spatial attention (Spl-Spal-At) module was proposed for identifying the different types of maize kernels. The HSI data within 380-1000 nm of six classes of sprouted, heat-damaged, insect-damaged, moldy, broken and healthy kernels was collected. The CNN-Spl-At, CNN-Spal-At and CNN-Spl-Spal-At models were established based on the spectra, images and their fusion features as inputs for the recognition of different kernels. Further compared the performances of proposed models and conventional models were built by support vector machine (SVM) and extreme learning machine (ELM). The results indicated that the recognition ability of CNN with attention series models was significantly better than that of SVM and ELM models and fused features were more conducive to expressing the appearance of different kernels than single features. And the CNN-Spl-Spal-At model had an optimal recognition result with high average classification accuracy of 98.04 % and 94.56 % for the training and testing sets, respectively. The recognition results were visually presented on the surface image of kernels with different colors. The CNN-Spl-Spal-At model was built in this study could effectively detect defective maize kernels, and it also had great potential to provide the analysis approaches for the development of non-destructive testing equipment based on HSI technique for maize quality.


Asunto(s)
Imágenes Hiperespectrales , Zea mays , Calor , Redes Neurales de la Computación , Máquina de Vectores de Soporte
10.
Comput Biol Med ; 172: 108265, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38461698

RESUMEN

Convolution operation is performed within a local window of the input image. Therefore, convolutional neural network (CNN) is skilled in obtaining local information. Meanwhile, the self-attention (SA) mechanism extracts features by calculating the correlation between tokens from all positions in the image, which has advantage in obtaining global information. Therefore, the two modules can complement each other to improve feature extraction ability. An effective fusion method is a problem worthy of further study. In this paper, we propose a CNN and SA paralleling network CSAP-UNet with U-Net as backbone. The encoder consists of two parallel branches of CNN and Transformer to extract the feature from the input image, which takes into account both the global dependencies and the local information. Because medical images come from certain frequency bands within the spectrum, their color channels are not as uniform as natural images. Meanwhile, medical segmentation pays more attention to lesion regions in the image. Attention fusion module (AFM) integrates channel attention and spatial attention in series to fuse the output features of the two branches. The medical image segmentation task is essentially to locate the boundary of the object in the image. The boundary enhancement module (BEM) is designed in the shallow layer of the proposed network to focus more specifically on pixel-level edge details. Experimental results on three public datasets validate that CSAP-UNet outperforms state-of-the-art networks, particularly on the ISIC 2017 dataset. The cross-dataset evaluation on Kvasir and CVC-ClinicDB shows that CSAP-UNet has strong generalization ability. Ablation experiments also indicate the effectiveness of the designed modules. The code for training and test is available at https://github.com/zhouzhou1201/CSAP-UNet.git.


Asunto(s)
Procesamiento de Imagen Asistido por Computador , Redes Neurales de la Computación
11.
Neuropsychologia ; 196: 108848, 2024 Apr 15.
Artículo en Inglés | MEDLINE | ID: mdl-38432323

RESUMEN

This study aimed to investigate whether neurological patients presenting with a bias in line bisection show specific problems in bisecting a line into two equal parts or their line bisection bias rather reflects a special case of a deficit in proportional reasoning more generally. In the latter case, the bias should also be observed for segmentations into thirds or quarters. To address this question, six neglect patients with a line bisection bias were administered additional tasks involving horizontal lines (e.g., segmentation into thirds and quarters, number line estimation, etc.). Their performance was compared to five neglect patients without a line bisection bias, 10 patients with right hemispheric lesions without neglect, and 32 healthy controls. Most interestingly, results indicated that neglect patients with a line bisection bias also overestimated segments on the left of the line (e.g., one third, one quarter) when dissecting lines into parts smaller than halves. In contrast, such segmentation biases were more nuanced when the required line segmentation was framed as a number line estimation task with either fractions or whole numbers. Taken together, this suggests a generalization of line bisection bias towards a segmentation or proportional processing bias, which is congruent with attentional weighting accounts of line bisection/neglect. As such, patients with a line bisection bias do not seem to have specific problems bisecting a line, but seem to suffer from a more general deficit processing proportions.


Asunto(s)
Lateralidad Funcional , Trastornos de la Percepción , Humanos , Trastornos de la Percepción/etiología , Atención , Sesgo , Generalización Psicológica , Percepción Espacial
12.
bioRxiv ; 2024 Mar 07.
Artículo en Inglés | MEDLINE | ID: mdl-38496524

RESUMEN

Attention enables us to efficiently and flexibly interact with the environment by prioritizing some image features, such as location or orientation, even before stimulus onset. We investigated how covert spatial attention affects responses in human visual cortex prior to target onset and how it affects behavioral performance after target onset, using a concurrent psychophysics-fMRI experiment. Performance improved at cued locations and worsened at uncued locations, relative to distributed attention. BOLD responses in cortical visual field maps changed in two ways: First, there was a stimulus-independent baseline shift, positive in map locations near the cued location and negative elsewhere. Second, population receptive field centers shifted toward the attended location. Both effects increased in higher visual areas. Together, the results show that spatial attention has large effects on visual cortex prior to target appearance, altering neural response properties across the entirety of multiple visual field maps.

13.
Neurosci Biobehav Rev ; 160: 105622, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38490498

RESUMEN

The present review examined the consequences of focal brain injury on spatial attention studied with cueing paradigms, with a particular focus on the disengagement deficit, which refers to the abnormal slowing of reactions following an ipsilesional cue. Our review supports the established notion that the disengagement deficit is a functional marker of spatial neglect and is particularly pronounced when elicited by peripheral cues. Recent research has revealed that this deficit critically depends on cues that have task-relevant characteristics or are associated with negative reinforcement. Attentional capture by task-relevant cues is contingent on damage to the right temporo-parietal junction (TPJ) and is modulated by functional connections between the TPJ and the right insular cortex. Furthermore, damage to the dorsal premotor or prefrontal cortex (dPMC/dPFC) reduces the effect of task-relevant cues. These findings support an interactive model of the disengagement deficit, involving the right TPJ, the insula, and the dPMC/dPFC. These interconnected regions play a crucial role in regulating and adapting spatial attention to changing intrinsic values of stimuli in the environment.


Asunto(s)
Lesiones Encefálicas , Trastornos de la Percepción , Humanos , Corteza Prefrontal , Trastornos de la Percepción/etiología , Señales (Psicología) , Percepción Espacial/fisiología , Lóbulo Parietal/fisiología , Lateralidad Funcional/fisiología , Tiempo de Reacción/fisiología
14.
J Imaging Inform Med ; 2024 Mar 06.
Artículo en Inglés | MEDLINE | ID: mdl-38448760

RESUMEN

Identifying indolent and aggressive prostate cancers is a critical problem for optimal treatment. The existing approaches of prostate cancer detection are facing challenges as the techniques rely on ground truth labels with limited accuracy, and histological similarity, and do not consider the disease pathology characteristics, and indefinite differences in appearance between the cancerous and healthy tissue lead to many false positive and false negative interpretations. Hence, this research introduces a comprehensive framework designed to achieve accurate identification and localization of prostate cancers, irrespective of their aggressiveness. This is accomplished through the utilization of a sophisticated multilevel bidirectional long short-term memory (Bi-LSTM) model. The pre-processed images are subjected to multilevel feature map-based U-Net segmentation, bolstered by ResNet-101 and a channel-based attention module that improves the performance. Subsequently, segmented images undergo feature extraction, encompassing various feature types, including statistical features, a global hybrid-based feature map, and a ResNet-101 feature map that enhances the detection accuracy. The extracted features are fed to the multilevel Bi-LSTM model, further optimized through channel and spatial attention mechanisms that offer the effective localization and recognition of complex structures of cancer. Further, the framework represents a promising approach for enhancing the diagnosis and localization of prostate cancers, encompassing both indolent and aggressive cases. Rigorous testing on a distinct dataset demonstrates the model's effectiveness, with performance evaluated through key metrics which are reported as 96.72%, 96.17%, and 96.17% for accuracy, sensitivity, and specificity respectively utilizing the dataset 1. For dataset 2, the model achieves the accuracy, sensitivity, and specificity values of 94.41%, 93.10%, and 94.96% respectively. These results surpass the efficiency of alternative methods.

15.
Front Neurosci ; 18: 1308370, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38476869

RESUMEN

Introduction: Electronic Sports (eSports) is a popular and still emerging sport. Multiplayer Online Battle Arena (MOBA) and First/Third Person Shooting Games (FPS/TPS) require excellent visual attention abilities. Visual attention involves specific frontal and parietal areas, and is associated with alpha coherence. Transcranial alternating current stimulation (tACS) is a principally suitable tool to improve cognitive functions by modulation of regional oscillatory cortical networks that alters regional and larger network connectivity. Methods: In this single-blinded crossover study, 27 healthy college students were recruited and exposed to 10 Hz tACS of the right frontoparietal network. Subjects conducted a Visual Spatial Attention Distraction task in three phases: T0 (pre-stimulation), T1 (during stimulation), T2 (after-stimulation), and an eSports performance task which contained three games ("Exact Aiming," "Flick Aiming," "Press Reaction") before and after stimulation. Results: The results showed performance improvements in the "Exact Aiming" task and hint for a prevention of reaction time performance decline in the "Press Reaction" task in the real, as compared to the sham stimulation group. We also found a significant decrease of reaction time in the visual spatial attention distraction task at T1 compared to T0 in the real, but not sham intervention group. However, accuracy and inverse efficiency scores (IES) did not differ between intervention groups in this task. Discussion: These results suggest that 10 Hz tACS over the right frontal and parietal cortex might improve eSports-related skill performance in specific tasks, and also improve visual attention in healthy students during stimulation. This tACS protocol is a potential tool to modulate neurocognitive performance involving tracking targets, and might be a foundation for the development of a new concept to enhance eSports performance. This will require however proof in real life scenarios, as well optimization.

16.
Artif Intell Med ; 149: 102782, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38462283

RESUMEN

Diabetic retinopathy (DR) is the most prevalent cause of visual impairment in adults worldwide. Typically, patients with DR do not show symptoms until later stages, by which time it may be too late to receive effective treatment. DR Grading is challenging because of the small size and variation in lesion patterns. The key to fine-grained DR grading is to discover more discriminating elements such as cotton wool, hard exudates, hemorrhages, microaneurysms etc. Although deep learning models like convolutional neural networks (CNN) seem ideal for the automated detection of abnormalities in advanced clinical imaging, small-size lesions are very hard to distinguish by using traditional networks. This work proposes a bi-directional spatial and channel-wise parallel attention based network to learn discriminative features for diabetic retinopathy grading. The proposed attention block plugged with a backbone network helps to extract features specific to fine-grained DR-grading. This scheme boosts classification performance along with the detection of small-sized lesion parts. Extensive experiments are performed on four widely used benchmark datasets for DR grading, and performance is evaluated on different quality metrics. Also, for model interpretability, activation maps are generated using the LIME method to visualize the predicted lesion parts. In comparison with state-of-the-art methods, the proposed IDANet exhibits better performance for DR grading and lesion detection.


Asunto(s)
Diabetes Mellitus , Retinopatía Diabética , Adulto , Humanos , Retinopatía Diabética/diagnóstico por imagen , Retinopatía Diabética/patología , Redes Neurales de la Computación , Interpretación de Imagen Asistida por Computador/métodos
17.
bioRxiv ; 2024 Jan 23.
Artículo en Inglés | MEDLINE | ID: mdl-38328139

RESUMEN

When analyzing complex scenes, humans often focus their attention on an object at a particular spatial location. The ability to decode the attended spatial location would facilitate brain computer interfaces for complex scene analysis (CSA). Here, we investigated capability of functional near-infrared spectroscopy (fNIRS) to decode audio-visual spatial attention in the presence of competing stimuli from multiple locations. We targeted dorsal frontoparietal network including frontal eye field (FEF) and intra-parietal sulcus (IPS) as well as superior temporal gyrus/planum temporal (STG/PT). They all were shown in previous functional magnetic resonance imaging (fMRI) studies to be activated by auditory, visual, or audio-visual spatial tasks. To date, fNIRS has not been applied to decode auditory and visual-spatial attention during CSA, and thus, no such dataset exists yet. This report provides an open-access fNIRS dataset that can be used to develop, test, and compare machine learning algorithms for classifying attended locations based on the fNIRS signals on a single trial basis.

18.
Neuroscience ; 542: 59-68, 2024 Mar 26.
Artículo en Inglés | MEDLINE | ID: mdl-38369007

RESUMEN

Brain Computer Interface (BCI) is a highly promising human-computer interaction method that can utilize brain signals to control external devices. BCI based on functional near-infrared spectroscopy (fNIRS) is considered a relatively new and promising paradigm. fNIRS is a technique of measuring functional changes in cerebral hemodynamics. It detects changes in the hemodynamic activity of the cerebral cortex by measuring oxyhemoglobin and deoxyhemoglobin (HbR) concentrations and inversely predicts the neural activity of the brain. At the present time, Deep learning (DL) methods have not been widely used in fNIRS decoding, and there are fewer studies considering both spatial and temporal dimensions for fNIRS classification. To solve these problems, we proposed an end-to-end hybrid neural network for feature extraction of fNIRS. The method utilizes a spatial-temporal convolutional layer for automatic extraction of temporally valid information and uses a spatial attention mechanism to extract spatially localized information. A temporal convolutional network (TCN) is used to further utilize the temporal information of fNIRS before the fully connected layer. We validated our approach on a publicly available dataset including 29 subjects, including left-hand and right-hand motor imagery (MI), mental arithmetic (MA), and a baseline task. The results show that the method has few training parameters and high accuracy, providing a meaningful reference for BCI development.


Asunto(s)
Interfaces Cerebro-Computador , Espectroscopía Infrarroja Corta , Humanos , Espectroscopía Infrarroja Corta/métodos , Redes Neurales de la Computación , Algoritmos , Corteza Cerebral/diagnóstico por imagen , Mano , Electroencefalografía/métodos , Imaginación
19.
Phys Med Biol ; 69(7)2024 Mar 18.
Artículo en Inglés | MEDLINE | ID: mdl-38306971

RESUMEN

Objective. Celiac disease (CD) has emerged as a significant global public health concern, exhibiting an estimated worldwide prevalence of approximately 1%. However, existing research pertaining to domestic occurrences of CD is confined mainly to case reports and limited case analyses. Furthermore, there is a substantial population of undiagnosed patients in the Xinjiang region. This study endeavors to create a novel, high-performance, lightweight deep learning model utilizing endoscopic images from CD patients in Xinjiang as a dataset, with the intention of enhancing the accuracy of CD diagnosis.Approach. In this study, we propose a novel CNN-Transformer hybrid architecture for deep learning, tailored to the diagnosis of CD using endoscopic images. Within this architecture, a multi-scale spatial adaptive selective kernel convolution feature attention module demonstrates remarkable efficacy in diagnosing CD. Within this module, we dynamically capture salient features within the local channel feature map that correspond to distinct manifestations of endoscopic image lesions in the CD-affected areas such as the duodenal bulb, duodenal descending segment, and terminal ileum. This process serves to extract and fortify the spatial information specific to different lesions. This strategic approach facilitates not only the extraction of diverse lesion characteristics but also the attentive consideration of their spatial distribution. Additionally, we integrate the global representation of the feature map obtained from the Transformer with the locally extracted information via convolutional layers. This integration achieves a harmonious synergy that optimizes the diagnostic prowess of the model.Main results. Overall, the accuracy, specificity, F1-Score, and precision in the experimental results were 98.38%, 99.04%, 98.66% and 99.38%, respectively.Significance. This study introduces a deep learning network equipped with both global feature response and local feature extraction capabilities. This innovative architecture holds significant promise for the accurate diagnosis of CD by leveraging endoscopic images captured from diverse anatomical sites.


Asunto(s)
Enfermedad Celíaca , Humanos , Enfermedad Celíaca/diagnóstico por imagen , Endoscopía
20.
Sensors (Basel) ; 24(4)2024 Feb 06.
Artículo en Inglés | MEDLINE | ID: mdl-38400207

RESUMEN

In recent years, the development of image super-resolution (SR) has explored the capabilities of convolutional neural networks (CNNs). The current research tends to use deeper CNNs to improve performance. However, blindly increasing the depth of the network does not effectively enhance its performance. Moreover, as the network depth increases, more issues arise during the training process, requiring additional training techniques. In this paper, we propose a lightweight image super-resolution reconstruction algorithm (SISR-RFDM) based on the residual feature distillation mechanism (RFDM). Building upon residual blocks, we introduce spatial attention (SA) modules to provide more informative cues for recovering high-frequency details such as image edges and textures. Additionally, the output of each residual block is utilized as hierarchical features for global feature fusion (GFF), enhancing inter-layer information flow and feature reuse. Finally, all these features are fed into the reconstruction module to restore high-quality images. Experimental results demonstrate that our proposed algorithm outperforms other comparative algorithms in terms of both subjective visual effects and objective evaluation quality. The peak signal-to-noise ratio (PSNR) is improved by 0.23 dB, and the structural similarity index (SSIM) reaches 0.9607.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...